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Human betacoronaviruses OC43 and HKU1 are endemic respiratory 
pathogens and, while related, originated from independent zoo- 
notic introductions. OC43 is in fact a host-range variant of the 
species Betacoronavirus-1, and more closely related to bovine coro- 
navirus (BCoV)—its presumptive ancestor—and porcine hemagglu- 
tinating encephalomyelitis virus (PHEV). The 61-coronaviruses 
(B1CoVs) and HKU1 employ glycan-based receptors carrying 9-O- 
acetylated sialic acid (9-O-Ac-Sia). Receptor binding is mediated by 
spike protein S, the main determinant of coronavirus host specific- 
ity. For BCoV, a crystal structure for the receptor-binding domain 
S14 is available and for HKU1 a cryoelectron microscopy structure of 
the complete S ectodomain. However, the location of the receptor- 
binding site (RBS), arguably the single-most important piece of in- 
formation, is unknown. Here we solved the 3.0-A crystal structure of 
PHEV S1*. We then took a comparative structural analysis approach 
to map the f61CoV S RBS, using the general design of 9-O-Ac-Sia- 
binding sites as blueprint, backed-up by automated ligand docking, 
structure-guided mutagenesis of OC43, BCoV, and PHEV $14, and 
infectivity assays with BCoV-S-pseudotyped vesicular stomatitis 
viruses. The RBS is not exclusive to OC43 and related animal viruses, 
but is apparently conserved and functional also in HKU1 $1“. The 
binding affinity of the HKU1 S RBS toward short sialoglycans is 
significantly lower than that of OC43, which we attribute to differ- 
ences in local architecture and accessibility, and which may be in- 
dicative for differences between the two viruses in receptor fine- 
specificity. Our findings challenge reports that would map the OC43 
RBS elsewhere in S1^ and that of HKU1 in domain $1°. 
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Cc (CoVs; order Nidovirales, family Coronaviridae) 
are enveloped positive-strand RNA viruses of mammals and 
birds. So far, four coronaviruses of zoonotic origin are known to 
have successfully breached the species barrier to become true 
human pathogens (1-6). These viruses—NL63, 229E, HKU1, and 
OC43—are persistently maintained in the human population 
through continuous circulation. Remarkably, the latter two both 
belong to a single minor clade, “lineage A,” in the genus Beta- 
coronavirus. Although generally associated with common colds, 
HKU1 and OC43 may cause severe and sometimes fatal pulmo- 
nary infections in the frail (7, 8), and in rare instances, OC43 may 
cause lethal encephalitis (9). OC43 and HKU1 are distinct viruses 
that entered the human population independently to seemingly 
follow convergent evolutionary trajectories in their adaptation to 
the novel host (10). OC43 is in fact more related to coronaviruses 
of ruminants, horses, dogs, rabbits, and swine, with which it has 
been united in a single species, Betacoronavirus-1. 

Lineage A betacoronaviruses like HKU1 and OC43 differ 
from other CoVs in that their virions possess two types of surface 
projections, both of which are involved in attachment: large 
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20-nm peplomers or “spikes” that are very much a CoV hallmark 
and comprised of homotrimers of spike (S) protein, and 8-nm 
protrusions, unique to this clade, comprised of the homodimeric 
hemagglutinin-esterase (HE). S is central to viral entry and the 
key determinant of host and tissue tropism (11). It mediates binding 
to cell-surface receptors and, upon uptake of the virion by the host 
cell, fusion between the viral envelope and the limiting endosomal 
membrane (12). In the case of HKU1 and B1 coronaviruses (61CoVs), 
OC43 included, S binds to sugar-based receptor-determinants, 
specifically to 9-O-acetylated sialic acids (9-O-Ac-Sias) attached 


Significance 


Human coronaviruses OC43 and HKU1 are related, yet distinct 
respiratory pathogens, associated with common colds, but also 
with severe disease in the frail. Both viruses employ sialoglycan- 
based receptors with 9-O-acetylated sialic acid (9-O-Ac-Sia) as key 
component. Here, we identify the 9-O-Ac-Sia-specific receptor- 
binding site of OC43 S and demonstrate it to be conserved and 
functional in HKU1. The considerable difference in receptor- 
binding affinity between OC43 and HKU1 S, attributable to 
differences in local architecture and receptor-binding site acces- 
sibility, is suggestive of differences between OC43 and HKU1 in 
their adaptation to the human sialome. The data will enable 
studies into the evolution and pathobiology of OC43 and 
HKU1 and open new avenues toward prophylactic and 
therapeutic intervention. 
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as terminal residues to glycan chains on glycoproteins and lipids 
(13, 14). HE, a sialate-O-acetylesterase with an appended 9-O-Ac- 
Sia—-specific lectin domain, acts as a receptor-destroying enzyme 
(15). During preattachment, the receptor-destroying enzyme 
activity of HE averts irreversible binding of virions to the decoy 
receptors that are omnipresent in the extracellular environment. 
Furthermore, at the conclusion of the replication cycle, HE- 
mediated destruction of intracellular and cell-surface receptors 
facilitates the release of viral progeny from the infected cell (16). 
Remarkably, the HEs of OC43 and HKU1 lost their ability to 
bind 9-O-Ac-Sias, because their lectin domain—functional in all 
other HEs studied so far—was rendered inactive (10). In conse- 
quence, the dynamics and extent of virion-mediated destruction of 
clustered receptor populations were altered presumably to match 
the sialome of the human respiratory tract and to optimize infection 
and/or transmission. Whether adaptation to the human host also 
entailed adaptations in S is not known and as yet cannot be assessed 
because for HKU1, the S receptor-binding site (RBS) has not been 
identified and the B1CoV S RBS has not been established with 
certainty (17). 

Crystal structure analysis of orthomyxo-, toro-, and coronavirus 
HEs complexed to receptor/substrate analogs identified 9-O-Ac- 
Sia binding sites, yielding exquisite insight into the architecture of 
these sites and into the general principles of ligand/substrate 
recognition at the atomic level (18-24). Recently, cryoelectron 
microscopy (cryo-EM) structures were reported for several S 
proteins, including that of HKU1 (25-29). The findings have 
greatly increased our understanding of the overall quaternary 
structure and function of the S homotrimers. Among others, the 
structures revealed how in betacoronavirus spikes the N-terminal 
S1 subunit of each S monomer folds into four individual domains— 
designated A through D as numbered from the N terminus (Fig. 
1A)—of which domains A and B may function in receptor binding 
(11). The spikes of B1CoVs OC43 and BCoV bind to O-acetylated 
sialic acid through domain A (S14), as determined by in vitro 
binding assays (30). There is limited structural information, how- 
ever, on how the spikes of HKU1 and the B1CoVs bind their li- 
gands. The apo-structure of the BCoV S1“ lectin domain was 
solved, but attempts to also solve the holo-structure reportedly 
failed (17). Based on the galectin-like fold of the $1“ domain and 
mutational analysis, the RBS was predicted (17). Although this 
model remains to be confirmed, it has been widely accepted by the 
field (27, 28, 31-33). Surprisingly, HKU1 S was recently reported 
to bind to its receptor via a domain other than S1“ (31). Despite 
the similarity of HKU1 domain A to that of BCoV and OC43, 
binding of HKU1 S1“ to 9-O-Ac-Sia was reportedly not detectable 
(30, 31). Prompted by these observations and the fact that the 
predicted RBS in B1CoV S1* in its design and architecture bears 
no resemblance to other 9-O-Ac-Sia binding sites, we sought to 
test the published model. The results led us to look for alternative 
binding sites in the B1CoV S1* domain through comparative 
structural analysis and in silico modeling using the general design 
of HE 9-O-Ac-Sia binding sites as a blueprint, backed up by structure- 
guided mutagenesis. Our findings show that the actual S1^ RBS 
in B1CoV S maps elsewhere than currently believed. Moreover, 
we demonstrate that the newly proposed site is not exclusive to 
B1CoVs, but in fact conserved and functional also in the S1* 
domain of HKU1. 


Results and Discussion 


The S1° RBS Is Located Elsewhere than Currently Believed. To test 
the validity of the current model, we measured the effect of 
substitutions in the proposed RBS using $1“—Fc fusion proteins. 
The binding properties of the mutated proteins were studied by 
hemagglutination assay (HAA) with rat erythrocytes and by 
solid-phase lectin-binding enzyme-linked immune assay (sp- 
LBA) with bovine submaxillary mucin (BSM) as ligand. These 
assays are complementary: HAA is the more sensitive of the two, 
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Fig. 1. 51° domains of §1CoV host range variants all bind 9-O-Ac-Sias, but 
with different affinities. (A) Schematic representation of p1CoV spike pro- 
tein with subunits S1 and $2, and $1 domains A through D indicated. Residue 
numbering is based on the OC43 strain ATCC/VR-759 spike protein (Gen- 
Bank: AAT84354.1), with domain boundaries based on the MHV S structure 
(26). CT, cytoplasmic tail; SP, signal peptide; TM, transmembrane domain. (B) 
S14 B1CoV variants differ in 9-O-Ac-Sia-binding affinity. (Left) Conventional 
HAA with rat erythrocytes (diluted to a final concentration of 0.25% in PBS, 
0.1% BSA) and twofold serial dilutions of S1^-Fc fusion proteins of B1CoV 
variants BCoV-Mebus, OC43-ATCC and PHEV-UU. S1^-Fc of MHV-A59 
(starting at 25 ng/L) was included as a negative control (56). HA was 
assessed after 2-h incubation at 4 °C. Wells scored positive for HA are 
encircled. HAAs were repeated at least three times. Representative experi- 
ments are shown. (Right) S1°-Fc-mediated HA is 9-O-Ac-Sia—dependent. 
HAAs were performed as above, but now with rat erythrocytes, depleted for 
9-O-Ac-Sias by prior sialate-O-acetylesterase treatment, as in ref. 47. (C) 
Differences in S1^-mediated 9-O-Ac-Sia-binding affinity among B1CoV var- 
iants $1“ protein as demonstrated by sp-LBA. $1*-Fc fusion proteins (two- 
fold serial dilutions, starting at 12.5 ng/L) were compared by sp-LBA for 
relative binding to BSM at 37 °C. MHV S1° was included as a negative con- 
trol. Experiments were performed at least three times, each time in tripli- 
cate, with each data point representing the average of the independent 
mean values. Mean + SDs were less than 10%; error bars omitted for 
esthetical reasons. (D) $14-Fc-binding to BSM is 9-O-Ac-Sia dependent. BSM 
was specifically depleted for sialate-9-O-acetyl moieties by on-the-plate 
sialate-O-acetylesterase treatment for 2 h with twofold serial dilutions of 
soluble hemagglutinin-esterase, as in refs. 10 and 47, starting at 75 U/L. 
Receptor destruction was assessed by sp-LBA with fixed concentrations of 
BCoV and OC43 $1*-Fc (0.3 ng/L and 1.2 ng/L, respectively). 


while the results of sp-LBA more precisely reflect differences in 
binding-site affinity (10). HAA and sp-LBA performed with S1“ 
of BCoV strain Mebus confirmed the binding to 9-O-Ac-Sias 
(Fig. 1 B-D). For comparison, the S1* domains of related 
B1CoVs OC43 strain ATCC and porcine hemagglutinating en- 
cephalomyelitis virus (PHEV) strain UU were included. Again, 
9-O-Ac-Sia—dependent binding was observed, but the affinity of 
the S1* domain of OC43 was ~32-fold lower than that of BCoV as 
measured by sp-LBA and that of PHEV was lower still (~1,450-fold). 
Substitution in PHEV and OC43 S14 of Ala for Tyr), Glu!*?, 
Trp'™, and His'*° (Fig. 24), residues deemed critical for binding 
of BCoV $1“ (17), indeed resulted in decreased binding (Fig. 2 B 
and C), but importantly and as in the original report, none of the 
mutations gave complete loss-of-function. The strongest effect 
was observed for Trp'™, but its substitution in the context of OC43 
S1“ merely reduced binding affinity to that of wild-type PHEV 
S14. In a reverse approach, we attempted to identify substitutions 
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Fig. 2. Evidence from structure-guided mutagenesis against site A being 
the S1* RBS. (A, Left) Cartoon representation of the crystal structure of the 
BCoV S1* domain as determined by Peng et al. (17) (PDB ID code 4H14). 
B-Sheets colored green, the a-helix colored red, 3;0-helices colored blue, and 
side chains of site A residues, supposedly critical for 9-O-Ac Sia binding (17), 
indicated in sticks and colored orange. (Right) Site A close-up in surface 
representation with side chains of residues, supposedly critical for 9-O-Ac-Sia 
binding, indicated as orange sticks. Side chains of residues that differ be- 
tween BCoV Mebus and OC43 ATCC S1° are indicated in cyan. (B) Substitu- 
tions of site A residues in PHEV and OC43 S1^-Fc result in partial, but not 
complete loss of RBS binding affinity. In PHEV and OC43 $1*-Fc, orthologs of 
BCoV Tyr’, Glu'®, Trp'®4, and His’? were substituted by Ala (See SI Ap- 
pendix, Table S1 for residue numbering). Mutant proteins were compared 
with parental (wild-type) $1“ with maximum binding of BCoV S1^-Fc set at 
100%. sp-LBA was as in Fig. 1C, but with data points representing mean 
averages of independent duplicate experiments. (C) Residual binding of 
PHEV and OC43 S1^-Fc mutants with Ala substitutions of site A residues as 
detected by HAA, performed as in Fig. 1B. (D) Substitutions of OC43 site A 
residues by BCoV orthologs do not increase RBS binding affinity as tested by 
sp-LBA and (E) by HAA. 
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that result in gain-of-function of OC43 S1“ (i.e., mutations that 
would raise its binding to that of BCoV S1*). To this end, we 
systematically replaced residues within or proximal to the pro- 
posed binding site in OC43 $1“ by their BCoV orthologs (Fig. 2A; 
see also SI Appendix, Fig. S1). Individual substitutions Arg'His, 
Lys'®'Val, Leu'®°Trp, and Ie'*°Thr (Fig. 2D), however, did not 
increase binding affinity, and neither did the replacement of residues 
147-151 (Ser-Thr-Gln-Asp-Gly) by Leu, which resulted not only 
in a large deletion but also in the removal of an N-glycosylation 
site. In fact, an OC43 S1* mutant, in which we combined all sub- 
stitutions and thus essentially reconstructed the proposed BCoV 
S1“ RBS in the OC43 background, did not differ in its binding 
affinity from the parental wild-type OC43 protein as measured 
by HAA and sp-LBA (Fig. 2D). From the combined results, we 
conclude that the actual RBS is located elsewhere. In further 
support, the RBS proposed by Peng et al. (17)—henceforth referred 
to as “site A”—does not conform to the typical anatomy of O-Ac- 
Sia binding sites (18-24). 


Crystallization Attempts to Identify the S1^ RBS. The most obvious 
and direct approach to identify the RBS would be by crystallo- 
graphic analysis of an S1“—ligand complex. In their report, Peng 
et al. (17) stated that all efforts to determine the $1“ holo-structure 
had been unsuccessful. We considered that the complications the 
authors encountered might be related to crystal packing, and 
therefore opted to perform crystallization trials not with BCoV or 
OC43 S1“, but with that of PHEV. Because its sequence varies 
from that of BCoV S1“ by 22%, we hoped that crystals of different 
packing topology would be produced, as was indeed the case. 
Crystals formed under a variety of conditions, but all with space 
group P3,21 with two S1* molecules per asymmetric unit. These 
crystals consistently disintegrated within seconds during soaking 
attempts with receptor analog methyl-5-N-acetyl-4,9-di-O-acetyl- 
a-neuraminoside. Crystals flash-frozen immediately upon ligand 
addition showed poor diffraction that did not extend beyond 10-A 
resolution. The observations are suggestive of ligand-binding in- 
terfering with crystal stability (SI Appendix, Fig. S2). 

As an alternative, we performed extensive cocrystallization 
screenings with PHEV $1“, both with native and EndoHrtreated 
protein samples, together with methyl-5,9-di-N-acetyl-a- 
neuraminoside (Neu5,9Ac,«2Me), a receptor analog chemically 
more stable than the 9-O-acetylated compound (22, 34). These 
attempts also remained unsuccessful as no crystals were formed. 
In a final effort to obtain crystals of altered packing that per- 
chance would be compatible with ligand soaking, each of the four 
N-glycosylation sites in PHEV S1“ were systematically removed, 
separately and in combination, by Asn-to-Gln substitutions. How- 
ever, the resulting proteins were prone to aggregation and hence 
unsuitable for crystallization. 

The diffraction data obtained for noncomplexed PHEV Sag 
eventually allowed us to solve the apo-structure to 3.0-A reso- 
lution (PDB ID 6QFY; SI Appendix, Fig. S1). While the results 
do not provide direct clues to the location of the B1CoV S RBS, 
they do permit a side-by-side comparison of the $1“ domains of 
two divergent B1CoVs. The difficulties met by us and others to 
identify the RBS by crystallography led us to switch strategy and 
to follow an alternative approach based on comparative structural 
analysis and visual inspection of $1“ domains using the general 
design of HE 9-O-Ac-Sia binding sites as a query. 


An Alternative RBS Candidate Identified by Comparative Structural 
Analysis. As explicated by Neu et al. (35), structural comparison 
of viral attachment proteins in complex with their sialoglycan- 
based ligands may aid to define common parameters of recog- 
nition, and in turn these “rules of engagement” may be used to 
predict the location of Sia-binding sites in viral proteins for 
which such structural information is still lacking. Studies by us 
and others on corona-, toro-, and orthomyxoviral HE(F) proteins 
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provide a wealth of structural data on how proteins recognize 
O-Ac-Sias (18-24). The O-acetyl Sia binding sites as they occur 
in HE lectin domains and esterase catalytic sites—despite major 
differences in structure and composition—conform to a common 
design in which ligand/substrate recognition is essentially based 
on shape complementarity and hydrophobic interactions, sup- 
ported by protein—-sugar hydrogen bonding involving character- 
istic Sia functions, such as the glycerol side chain, the 5-N-acyl 
moiety and, very often, the carboxylate. Typically, the critical O- 
acetyl moiety docks into a deep hydrophobic pocket (P1) and the 
5-N-acyl in an „adjacent hydrophobic pocket/depression (P2). 
P1 and P2, ~7 A apart in 9-O-Ac-Sia RBSs, are separated by an 
aromatic side chain, placed such that in the bound state the side 
chain intercalates between the O- and N-acyl groups, often with 
the ring structure positioned to allow for CH/z interactions with 
the O-acetyl-methyl moiety (18-20, 22-24). Site A previously 
proposed as the BCoV S1“ RBS (17) clearly does not conform to 
this signature. Visual inspection of the PHEV and BCoV S1“ 
structures, however, identified a region distal from site A that in 
many aspects does resemble a typical O-Ac- Sia binding site, with 
two hydrophobic pockets separated by the Trp” indole (Fig. 34). 
We will refer to this location as site B. Interestingly, in the BCoV 
$1“ apo-structure, at the rim of site B, a sulfate ion is bound 
through hydrogen bonding with Lys®! and Thr*’. Its presence is 
of considerable significance as oxoanions in apo-structures often 
are indicative of and informative on interactions between the 
RBS and ligand-associated carboxylate moieties (36-38). Thus, 
in apo-structures of other Sia-binding proteins, sulfate and 
phosphate anions were found to mimic the Sia carboxylate in 
topology and sugar-protein hydrogen bonding (37, 38). Auto- 
mated docking of 9-O, 5-N-Ac-Sia with the Sia carboxylate an- 
chored at the position of the sulfate ion showed that the ligand 
would fit into site B, with the 9-O-acetyl group, most critical for 
ligand recognition (15, 39), docking into the more narrow and 
deeper pocket of the two (Fig. 3B). The 5-N-acyl moiety would 
be accommodated by a hydrophobic patch within the adjacent 
pocket, which is wide enough to also accept a 5-N-glycolyl group. 

To explain the architecture of site B, SI Appendix, Fig. S1 
presents the overall structure of BCoV s1^ and the designation 
of secondary structure elements. Central to $1“ is a B-sandwich 
scaffold, with one face packing against a separate domain con- 
taining the C-terminal region and the other covered by loop 
excursions that form a topologically distinct layer. The putative 
binding site is located within this layer at an open end of the 
B-sandwich with at its heart the B5-3,91 region (residues 80-95). 
Leu®’, Leu®®, Trp”, Phe”, and Phe” form a well-packed hy- 
drophobic core that on one side interacts with the underlying 
B-sandwich. On the other side, along the protein surface, it is 
wrapped by residues from the N-terminal loop 1-B1-loop 2 seg- 
ment (L161L2) to form site B pocket P1. L1B1L2 is locked in 
position through disulfide bonding of Cys” and f12-residue 
Cys’, and through extensive intersegment hydrogen t bonding 
with B5-3191 residues, s, involving Ser” and Asn” via main-chain 
interactions with Leu and Arg”, Val” via main-chain-side-chain 
interaction with Ser®’, and Thr" via side-chain-side-chain inter- 
action with Trp”. 

Site B is conserved in all three B1CoVs and, from S cryo-EM 
structures obtained for other lineage A betacoronaviruses (26, 
27), should be readily accessible also in the context of the fully 
folded intact spike (SI Appendix, Fig. S3). In the PHEV and 
BCoV S1* crystals (17), however, site B—but notably not site 
A—is occluded by packing contacts, in the case of BCoV S1* 
even via coincidental intermonomeric site B-site B interactions 
(SI Appendix, Figs. S2 and S4). Thus, the rapid disintegration of 
PHEV S1* crystals during soaking may well be explained by 
disruption of crystal contacts in result of ligand-binding. 
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Fig. 3. Evidence from comparative structural analysis and structure-guided 
mutagenesis to suggest that the S1* RBS locates at site B. (A) Close-up of 
BCoV S14 site B in surface representation with hydrophobic pockets P1 and 
P2 indicated, and with side chains of residues, proposedly involved in 
receptor-binding, shown in sticks and colored by element. A sulfate ion 
bound to site B in the BCoV S14 apo structure (PDB ID code 4H14) is also 
shown (oxygen, red; nitrogen, blue; carbons, gray; sulfur, yellow). Hydrogen 
bonds between the sulfate ion and site B residues Lys®' and Thr®? are shown 
as purple dashed lines. (B) Cartoon representation of BCoV $1“ in complex 
with 9-O-Ac-Sia as modeled by automated molecular docking with Auto- 
dock4 (22). Residues predicted to be involved in ligand-binding are shown in 
stick representation and colored as in A. Sia-9-O-Ac is also shown in sticks, 
but with carbon atoms colored cyan. Predicted hydrogen bonds between 
Lys®’ and Thr®? side chains and the Sia carboxylate moiety, and between the 
Lys®’ main chain and the Sia-5-N-Ac moiety are shown as purple dashed 
lines. (C) Substitutions of site B residues in PHEV and OC43 S1^-Fc result in 
complete loss of binding as detected by sp-LBA, performed as in Fig. 2B. (D, 
Upper) Substitutions of site B residues in PHEV and OC43 S1°-Fc result in 
substantial to complete loss of binding as measured by conventional HAA at 
4 °C. (Lower) Mutations, resulting in residual binding, render the S14 RBS 
thermolabile. HAA shown (Upper) after a temperature shift-up to 21 °C and 
continued 16-h incubation. (E) Substitution of proposed ligand contacting 
residues Trp®°, Lys®', and Thr®?/Ser®? results in total loss of detectable 
binding of BCoV, OC43 and PHEV S1^-Fc even when assayed by high- 
sensitivity nanoparticle HAA. Assays performed with S1^-Fc protein multi- 
valently presented on 60-meric protein A-lumazine synthase icosahedral 
shells (pA-LS) as in ref. 40. 
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Mutational Analysis of 61CoV S1^ Site B. To test whether site B 
represents the true RBS, we again performed structure-guided 
mutagenesis of PHEV and OC43 S1* (Fig. 3). Of note, the 
mutant proteins tested were expressed and secreted to wild-type 
levels, arguing against gross folding defects (for a quantitative 
comparison of the expression levels of key mutant proteins and 
their melting curves, see ST Appendix, Fig. S5). From the docking 
model, Trp”, Lys*!, and Thr” (Ser® in OC43 S14) are predicted 
to be critically involved in ligand binding. In accordance, their 
replacement by Ala led to complete loss of detectable binding by 
sp-LBA (Fig. 3C). Upon substitution of Trp” by Ala, binding 
was no longer observed for either PHEV or for OC43 S14, not 
even by HAA, be it conventional or high-sensitivity nanoparticle 
(NP-)HAA (40) (Fig. 3 D and E). Substitution of either Lys*! or 
Thr** in PHEV S1* led to loss of detectable binding also as 
measured by conventional HAA at 4 °C, but for OC43 S1*-Ser*? 
Ala and OC43 S1“-Lys*'Ala residual binding was detected. Sa- 
liently, HA by OC43 S1“-Lys*'Ala fully resolved after a shift-up 
to room temperature, indicating that the mutation renders the 
receptor-ligand interaction thermolabile and that binding affin- 
ity is more severely affected under physiological conditions (Fig. 
3D). Importantly, combined substitution of Lys*’ and Ser” in 
OC43 S1* resulted in complete loss of detectable binding even 
by NP-HAA (Fig. 3E). Also, Ser®’Ala substitution, which dis- 
rupts the hydrogen bond with Val? and thereby weakens the 
association between L1B1L2 and B5-3;91, caused loss of binding 
as detected by sp-LBA, and a considerable decrease in binding as 
measured by HAA. Similar results were obtained upon sub- 
stitution of Leu®° and Leu®®, which line the proposed P1 and P2 
pockets, respectively (Fig. 3). Finally, replacement of L1p1L2 
residue Thr*', thus abolishing the hydrogen bond with the p5- 
3101 residue Trp”, reduced binding affinity in sp-LBA by more 
than a 1,000-fold, as demonstrated for OC43 $1“ (SI Appendix, 
Fig. S6). 

To further test our model, we asked whether we might also 
identify gain-of-function mutations (i.e., substitutions within or 
proximal to site B that would increase S1^ binding affinity). 
Among the three B1CoVs, the proposed RBS itself is highly 
conserved. OC43 and BCoV S1”, for example, differ at site B 
only at position 83, which is either a Thr (in BCoV) or Ser (in 
OC43). Despite this near identity, BCoV S1^ is the stronger 
binder of the two as indicated by a consistent 32-fold difference 
in binding efficiency as measured by sp-LBA (Fig. 1C). The 
difference at position 83 is particularly intriguing, as this residue 
is predicted to be involved in ligand binding through hydrogen 
bond formation with the Sia carboxylate (Fig. 3B). Indeed, Ser™ 
Thr substitution in OC43 $1“ resulted in a profound increase in 
binding affinity almost to that of BCoV S1” (Fig. 44). 

Comparative sequence analysis of BCoV and PHEV S1* 
revealed several amino acid differences proximal to site B— 
among which are Thr®*Arg, Thr?*Asn, and Val?*Ser—that 
would alter hydrogen bonding within L1 and between L1 and B5- 
3101 (Fig. 4 and SI Appendix, Fig. S1). Moreover, the latter two 
together create an additional N-glycosylation site in PHEV S1*. 
To test whether and how these differences affect receptor 
binding, PHEV residues were replaced by their BCoV orthologs 
(Fig. 4B and C). The mutations again led to gain-of-function. 
Thr**Arg substitution, disrupting a Thr**-Asp** hydrogen bond 
absent in BCoV S1Ô, resulted in a consistent twofold increase in 
binding affinity as measured by sp-LBA. Asn”’Thr substitution, 
eliminating the glycosylation site in PHEV S1*, raised binding 
affinity even 50-fold. However, Ser“Val substitution, also 
destroying the glycosylation site, did not have an effect in iso- 
lation. Yet, Ser“*Val and Asn”’Thr in combination raised PHEV 
S1^-binding affinity by a further 30-fold almost to that of BCoV 
(Fig. 4B). Apparently, the difference in $1“-binding affinity be- 
tween PHEV and BCoV can be ascribed to the architecture of 
site B as determined by the L1-$5-3,91 hydrogen bonding net- 
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Fig. 4. Increased RBS binding affinity of OC43 and PHEV S1*-Fc upon re- 
placement of site B residues by BCoV orthologs. (A) Comparison of binding 
affinity of OC43 $1*-Ser®*Thr to that of parental OC43 S1^ and BCoV S14 by 
sp-LBA and conventional HAA. (B) Comparison of binding affinity of PHEV 
S1^ô mutants (TSR, N22T, S?4V individually, and N22T and S*“V combined) to 
that of parental PHEV $1“ and BCoV $1° by sp-LBA and conventional HAA. 
Assays performed and results shown as in Fig. 1. (C) Differences in PHEV and 
BCoV $14 binding affinity correlate with amino acid variations in loop L1 and 
the resulting changes in an intricate site B-organizing hydrogen bonding 
network. Side view of S1^ site B in BCoV (Left) and PHEV (Right) in combined 
cartoon and sticks representation to highlight differences in intra- and in- 
tersegment hydrogen bonds that fix the central site B B5-3191 segment 
through interactions with loop L1 and with the 3102-6812’ loop (SI Appendix, 
Fig. $1). Note the pivotal role of the 3192-12’ loop Arg'®° side chain bridging 
the L1 and £5f6 loops through multiple interactions with main chain car- 
bonyls (depicted in sticks), and the difference between BCoV and PHEV S14 
in intraloop L1 hydrogen bonding between residues 24 and 26. 


work rather than to the absence or presence of a glycan at po- 
sition 22 (Fig. 4B). Changes in $1“ L1-f5-3,91 association may 
alter pocket P1 and thereby affect ligand binding (SI Appendix, 
Fig. S7; note the difference in the topology of Leu®® and Trp” 
side chains in PHEV and BCoV S1*). 

One remaining question is how Tyr'©’, Glu'®’, Trp!**, and His'®° 
in site A affect the RBS and why their substitution reduces ligand- 
binding affinity, albeit modestly compared with mutations within 
site B. Within the structure the sites are spaced relatively closely 
together and indirect effects of site A mutations can be envisaged. 
One possible explanation is the location of strand B12’, comprising 
Glu’ and Trp right next to the first residues of L1. As we show, 
mutation of L1 residues Asn”, Val“, Thr”, or of residues that 
interact with L1 like Ser?” have a drastic effect on ligand binding. 
Through a similar mechanism, mutations that destabilize strand 
612’ may in turn affect L1 and ligand binding at site B. 

In summary, our analyses revealed both loss-of-function and 
gain-of-function mutations within or in close proximity of site B, 
each of which in accord with our model for $1“ binding of 9-O-Ac- 
Sia in B1CoVs. Moreover, they provide a plausible explanation for 
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Fig. 5. Mutations in site B but not site A reduce infectivity of BCoV S- 
pseudotyped VSV particles. (A) Schematic outline of the production of 
BCoV S-pseudotyped VSV-AG/Fluc and of the infection experiments. (B) 
Protein composition of (mock) pseudotyped VSV particles. Western blot 
analysis of purified and concentrated virus particles, secreted by mock- 
transfected, VSVAG/Fluc-transduced cells (m) or by transduced cells, trans- 
fected to transiently express Flag-tagged BCoV strain Mebus S (wild-type) or 
mutants thereof. BCoV S was either expressed exclusively (— Endo HE-Fc; 
Upper) or coexpressed with HE-Fc (+ Endo HE-Fc; Lower). Arrowheads in- 
dicate noncleaved S (S), the C-terminal subunit S2 resulting from intracellular 
proteolytic cleavage of S (S2), and the N protein of VSV. Note that coex- 
pression of HE-Fc promotes cleavage of BCoV S, presumably by preventing 
inadvertent receptor association during S biosynthesis. Also note that in 
the absence of endogenous HE-Fc, site B mutants are cleaved to ex- 
tents inversely related to RBS affinity. (C) Infectivity assays with BCoV S- 
pseudotyped viruses. Virus particles, pseudotyped with BCoV S or mutants 
thereof and produced in the presence or absence of intracellular HE-Fc 


6 of 10 | www.pnas.org/cgi/doi/10.1073/pnas.1809667116 


the modest yet reproducible loss of binding affinity upon sub- 
stitution of residues previously identified by Peng et al. (17). The 
results therefore lead us to conclude that site B identified here 
represents the true S1^ RBS. 


Site B Is Essential for Infectivity. To directly test the relevance of 
sites A and B for infectivity, we pseudotyped G protein-deficient, 
luciferase-expressing recombinant vesicular stomatitis virus 
(VSV) with Flag-tagged BCoV S or with mutant derivatives (see 
Fig. 5A for a schematic outline of the experiment). Considering 
that for B1CoVs HE is an essential protein (10, 16), we expressed 
wild-type BCoV S either exclusively or together with a secretory 
HE-Fc fusion protein. Because HE-Fc is not membrane- 
associated, it will not be included in the VSV envelope, but 
would deplete endogenous 9-O-Ac-Sia pools in the exocytotic 
compartment and from the cell surface. Thus, virus release would 
be facilitated and inadvertent receptor association during S bio- 
synthesis and receptor-mediated virion aggregation would be 
prevented. Wild-type S in the absence of HE-Fc was incorporated 
in VSV particles in its noncleaved 180K form (Fig. 5B, Upper). 
However, the resulting particles—purified and concentrated from the 
tissue culture supernatant by sucrose cushion ultracentrifugation— 
were noninfectious as measured by luciferase assay (Fig. 5C). 
VSV particles, produced in cells that coexpressed S and HE-Fc, 
carried spikes that resembled those described for BCoV and 
OC43 virions in that a large proportion had been proteolytically 
cleaved into subunits S1 and S2 (Fig. 5B, Lower) (41, 42). More- 
over, these particles were infectious. Values of relative light units 
(RLUs) detected in inoculated cells were more than 40-fold above 
background. Strikingly, their infectivity was increased by a further 
5.7-fold upon addition of “exogenous” HE-Fc to the inoculum 
(Fig. 5 A and C). Hence, these conditions were chosen to deter- 
mine the effects of mutations in BCoV S. Site B mutations caused 
complete loss of infectivity (Trp’’Ala), or significantly reduced 
infectivity to extents correlating with the effect of each mutation 
on S binding affinity (Trp’’Ala > Lys*'Ala > Thr”Ala) (Figs. 3 
and 5C). Particles pseudotyped with site A mutants were as in- 
fectious as S*'-VSV. The combined data reinforce our conclusion 
that site B is the RBS and provide direct evidence that this site is 
essential for BCoV infectivity. 


HCoV-HKU1 also Binds to 9-0-Ac-Sia via S1^ Site B. HKU1 is a 
lineage A betacoronavirus but separated from the B1CoVs by a 
considerable evolutionary distance. To illustrate, 1° of HKU1 is 
only 55-60% identical to those of BCoV and OC43. However, 
HKU1, like the B1CoVs, uses 9-O-Ac-sialoglycans for attachment 
and the binding to these receptors is essential for infection (14). 
According to recent publications, the RBS would not be in $1“, but 
in a downstream domain (31, 32), the main argument being that 
HKU1 S1“ does not detectably bind to glycoconjugates (30). In- 
deed, we also did not detect binding of HKU1 S1“_Fe, at least not 
by standard sp-LBA and conventional HAA (Fig. 6 A and E). 
However, when HKU1 S1“—Fc was tested by high-sensitivity NP- 
HAA with multivalent presentation of the HKU1 S1^-Fc proteins, 
agglutination of rat erythrocytes was observed (Fig. 64). Moreover, 
depletion of cell surface 9-O-Ac-Sia receptors by pretreatment of 
the erythrocytes with BCoV HE completely prevented HA (Fig. 
6A). These findings conclusively demonstrate that, in fact, HKU1 


(endo HE) were used to inoculate HRT18 cells, either with (+) or without (—) 
exogenous HE (exo HE) added to the inoculum. “Infectivity” is expressed in 
RLUs detected in lysates from inoculated cells at 18 h after inoculation. RLU 
values were normalized with those measured for VSV-S“t set at 100%. The 
data shown are averages from three independent experiments, each of which 
performed with technical triplicates. SDs and significant differences, calculated 
by Welch’s unequal variances t test, are indicated (**P < 0.01; ***P < 0.001, 
****P < 0.0001). 
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Fig. 6. HKU1 S14 site B mediates binding to 9-O-Ac-Sias. (A) HKU1 $14-Fc 
binds to 9-O-Ac-Sia through low-affinity, high-specificity interactions. (Up- 
per) Conventional HAA (top row) and high-sensitivity pA-LS NP-HAA (bot- 
tom rows) with native HKU1 $14-Fc (HKU1) or mutants thereof (Lys®°Ala, 
Thr®?Ala, Trp®?Ala). Noncomplexed nanoparticles were included as negative 
control (—). (Lower) NP-HAA with erythrocytes (mock)depleted for 9-O-Ac- 
Sias by prior sialate-O-acetylesterase treatment. (B) Surface representation 
of site B in BCoV (Top) and HKU1 (Middle) [PDB ID code 5108 (27)], re- 
spectively. Protein structures were aligned with site B residues Lys®', Thr, 
and Trp?’ as query. Side chains of site B residues shown in sticks. BCoV res- 
idues 29-34 (element e1) and 246-253 (e2), and corresponding elements in 
HKU1 indicated in orange. (Bottom) Overlay of cartoon representations of 
BCoV (purple) and HKU1 (white) 514 centered on site B. In the HKU1 protein, 
e1 and e2 are colored orange and marked. Side chains of Asn? and Asn?°", 
acceptors of N-linked glycosylation, indicated in sticks. BCoV Trp’? and its 
HKU1 ortholog, also in sticks, shown as a reference point to site B. (C, Upper) 
Replacement of HKU1 $1“ e1 and e2 by corresponding BCoV elements in- 
creases binding affinity to 9-O-Ac-sialoglycans present on rat erythrocytes. 
NP-HAA with native HKU1 S1^-Fc (HKU1) or with HKU1 $14—Fc derivatives, 
with e1 and e2 replaced by the corresponding BCoV elements separately (B- 
e1, B-e2) or in combination (B-e1+e2). For comparison, mutant HKU1 S1^-Fc 
proteins were included with N-glycosylation sites eliminated in site A (N'7'Q) 
or in elements e1 (N22Q) and e2 (N2°'Q), individually and in combination 
(N?°Q + N?°'Q). (C, Lower) NP-HAA with erythrocytes (mock)depleted for 9- 
O-Ac-Sias. (D) Replacement of HKU1 $14 e1 and e2 by corresponding BCoV 
elements increases binding affinity to 9-O-sialoglycans as to allow detection 
by conventional HAA. (E) Replacement of HKU1 S14 e1 and e2 increases 
binding affinity as to allow detection of 9-O-Ac-sialoglycans by nanoparticle 
sp-LBA. (Left) Conventional sp-LBA with soluble S1°-Fe. (Right) Nanoparticle 
sp-LBA with S1^-Fc complexed to pA-LS icosahedral shells in twofold serial 
dilutions, starting at 50 nmol pA-LS per well. Data points are mean averages 
of independent duplicate experiments, each performed in triplicate. 
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S1* does bind to 9-O-Ac-Sia in a 9-O-Ac-dependent fashion 
apparently through low-affinity high-specificity interactions. The 
detection of such interactions critically depends on multivalency of 
receptors and receptor-binding proteins (10, 40), and thus they are 
easily missed. This requirement for multivalency may also explain 
why preincubation of cells with soluble $1“—Fe fusion proteins did 
not block HKU1 infection (32). 

Interestingly, the RBS identified for B1CoV S1“ is conserved 
in HKU1 in sequence and structure (Fig. 6B and SI Appendix, 
Figs. S1B and S8), raising the question as to whether binding of 
HKUI S to its sialoglycan receptor occurs through the same site. 
Indeed, Ala substitution of HKU1 S1“ Lys*’, Thr®* or Trp°— 
orthologs of Lys*’, Thr®’, and Trp”? in BCoV S1^—resulted in a 
complete loss of detectable binding (Fig. 64; see SI Appendix, 
Fig. S9 for an analysis in the context of S1—Fc). To further study 
whether this site is involved in 9-O-Ac-Sia binding and to un- 
derstand the structural basis for the considerable difference in 
affinity between HKU1 and B1CoV S1* domains, we performed 
a side-by-side structure comparison. While in B1CoV S1* do- 
mains the RBS is readily accessible, the corresponding site in 
HKU1 S1^ is located at the bottom of a canyon because it is 
flanked by protruding parallel ridges comprised of HKU1 S1“ 
residues 28 through 34 [element (e)1 corresponding to BCoV 
L262 residues 29-35], and 243 through 252 (e2 corresponding 
to BCoV residues 253-256 in the 618-619 loop), each decorated 
with an N-linked glycan (Fig. 6B; see also SI Appendix, Fig. S1B) 
(27). Conceivably, this assembly would hamper binding par- 
ticularly to short sialoglycans, such as the O-linked STn sugars 
(Sia-02,6GalNAc-a1-O-Ser/Thr) predominantly present on BSM 
(43). Individual exchange of el and e2 in HKU1 S1* by the 
corresponding segments from BCoV indeed increased receptor- 
binding as measured by NP-HAA by 16- or 256-fold for e1 and e2, 
respectively (Fig. 6C). Exchange of e2 in fact enhanced receptor 
affinity to levels that now also allowed detection of binding by 
conventional HAA, while simultaneous replacement of both elements 
increased binding affinity even further (Fig. 6D). Importantly, pre- 
treatment of the erythrocytes with BCoV HE to deplete receptors 
by Sia-de-O-acetylation prevented hemagglutination (Fig. 6C), in- 
dicating that the enhanced binding was still 9-O-Ac-Sia—dependent. 
Replacement of e2 increased receptor affinity to such extent that 
also binding to BSM by sp-LBA assay became detectable, albeit 
only with nanoparticle-bound and not with free S1^ (Fig. 6E). 

To study whether the N-glycans attached to el and e2 
hinder receptor-binding and thus contribute to the apparent 
low affinity of HKU1 S14, we expressed the protein in N- 
acetylglucosaminyltransferase I-deficient HEK293S GnTT cells 
(44). Replacement of complex N-glycans by high-mannose sugars 
resulted in an eightfold increase in binding affinity as measured 
by NP-HAA (SI Appendix, Fig. S9). In agreement, combined 
disruption of the N-linked glycosylation sites in e1 and e2 through 
Gin substituting for Asn” and Asn”™' increased binding affinity to 
a similar extent, with the latter mutation exerting the largest effect 
(Fig. 6C). The deletion of glycosylation sites, however, did not 
enhance affinity to that of the HKU1-BCoV S1^/e1+e2 chimera, 
indicating that receptor binding is hampered not only by the N- 
linked glycans, but also by the local protein architecture. Of note, 
removal of the glycan attached to Asn'”', proximal to the binding 
site originally predicted by Peng et al. (17), did not enhance but 
actually lowered the binding affinity of HKU1 S1“ (Fig. 6C), 
possibly by affecting protein folding. 

In summary, our results provide direct evidence that HKU1 
binds to 9-O-Ac-Sias via S domain A. The data again argue 
against a role for site A (17), and on the basis of loss- and gain- 
of-function mutations strongly indicate that HKU1 binds its 
sialoglycan receptor via site B here identified for B1CoVs. On a 
critical note, we are aware that in the published HKU1 S apo- 
structure, the orientation of the Lys*' side chain differs from that 
in BCoV S1* such that hydrogen bonding with the Sia carboxylate 
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would be precluded. Whether this is an inaccuracy in the structure, 
whether the orientation of the side chain changes during ligand 
binding, or whether this is indeed a factual difference between 
HKU1 and B1CoV RBSs remains to be seen. Be it as it may, our 
findings do imply that in two human coronaviruses, related yet 
separated by a considerable evolutionary distance, the RBS and 
general mode of receptor-binding has been conserved. It is 
tempting to speculate that the apparently low affinity of HKU1 
S1^ is a virus-specific adaptation to replication in the human tract. 
Conceivably, occlusion of the HKU1 S RBS by e1 and e2 might 
allow selective high-affinity binding to particular sialoglycans with 
the 9-O-Ac-Sia terminally linked to extended glycan chains as to 
prevent nonproductive binding to short stubby sugars—such as 
present on BSM and other mucins—that otherwise would act as 
decoys and inhibitors. In analogy, influenza A H3N2 variants and 
2009 pandemic H1N1 (Cal/04), initially assumed to carry low- 
affinity HAs, were recently reported to have evolved a preference 
for a subset of human-type «2,6-linked sialoglycan-based re- 
ceptors comprising branched sugars with extended poly-N- 
acetyl-lactosamine (poly-LacNAc) chains (45). Whether such an 
adaptation would translate in distinctive differences between 
OC43 and HKU1 with respect to dynamic virion-receptor in- 
teractions and cell tropism merits further study. 


Materials and Methods 


Protein Design, Expression, and Purification. The 5’-terminal sequences of the 
S genes of BCoV strain Mebus (GenBank: P15777.1), OC43 strain ATCC 
VR-759 (GenBank: AAT84354.1), PHEV strain UU (GenBank: ASB17086.1), 
HKU1 strain Caen1 (GenBank: ADNO0O3339.1), and mouse hepatitis virus 
(MHV) strain A59 (GenBank: P11224.2), encoding the signal peptide and 
adjacent 51° domain, were cloned in expression vector pCAGGS-Tx-Fc (46). 
Domain A, defined on the basis of the cryo-EM structure of the MHV-A59 S 
ectodomain (26), corresponds to S amino acid residues 15-294, 15-298, and 
15-294 of BCoV, OC43, and PHEV, respectively. The amino acid sequences of 
these S1^-Fc fusion proteins were deposited in GenBank (MG999832-35). 
Recombinant $1 proteins, genetically fused to the Fc domain of human 
IgG, were purified by protein A affinity chromatography from the super- 
natants of transiently transfected HEK293T cells as in Zeng et al. (18). 


HAA. Standard HAA was performed as in Zeng et al. (18). Twofold serial di- 
lutions of CoV $14-Fc proteins (starting at 2.5 pg per well; 50 uL per well) were 
prepared in V-shaped 96-well microtiter plates (Greiner Bio-One) unto which 
was added 50 ul of a rat erythrocyte suspension (Rattus norvegicus strain 
Wistar; 0.5% in PBS). High-sensitivity NP-HAA was performed as in Li et al. (40). 
In brief, S1^-Fc proteins were complexed with pA-LS (a self-assembling 60- 
meric lumazine synthase nanoparticle, N-terminally extended with the Ig Fc- 
binding domain of the Staphylococcus aureus protein A) at a 0.6:1 molar ratio 
for 30 min on ice, after which complexed proteins were twofold serially di- 
luted and mixed 1:1 with rat erythrocytes (0.5% in PBS). HA was assessed after 
2-h incubation on ice unless stated otherwise. For specific depletion of cell- 
surface O-Ac-Sias, erythrocytes (50% in PBS pH 8.0) were (mock-)treated with 
soluble HE* (0.25 pg/L BCoV-Mebus and 0.25 pg/L PToV-P4) for 4 h at 37 °C 
comparable to as described in Langereis et al. (47). 


Sp-LBA. The 96-well Maxisorp microtitre ELISA plates (Nunc) were coated with 
0.1-g BSM (Sigma-Aldrich) per well. Conventional sp-LBAs were performed 
using twofold serial dilutions of CoV S1°-Fc proteins, as described previously 
(20, 47). In nanoparticle sp-LBA experiments, BSM-bound nanoparticles were 
detected with StrepMAb-HRP via the C-terminally appended Strep-tag of 
the pA-LS proteins, as described previously (40). Receptor-depletion assays 
were performed by (mock-)treatment of coated BSM with twofold serial 
dilutions of soluble HE* in PBS (100 uL per well) starting at 0.6 ng/L (BCoV- 
Mebus) for 2 h at 37 °C, as in Langereis et al. 47. Receptor destruction was 
assessed by sp-LBA with B1CoV S1^-Fc at fixed concentrations (BCoV 0.3 ng/L; 
OC43 1.2 ng/L). 


Glycoside Synthesis and NMR Analysis. The starting material Neu5Ac2aMe (48) 
was treated with p-toluenesulfonyl chloride (1.8 eq.) in pyridine overnight 
and then with sodium azide (5 eq.) in DMF at 70 °C. The resulting in- 
termediate was subjected to Staudinger reduction by 1 M trimethylphos- 
phine in toluene (3 eq.) in the presence of potassium hydroxide. Upon 
completion, acetyl chloride (4 eq.) was added directly. After 5 min, 1 M 
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potassium hydroxide solution was added to quench the reaction. The mix- 
ture was neutralized with acidic resin. The residue was purified with silica gel 
and then p-2 biogel to give pure Neu5,9NAc22aMe at a yield of 31% over 
four steps. The product was dissolved in 50 mM ammonium bicarbonate and 
freeze-dried, which was repeated three times to give the ammonium salt 
form. The final product was analyzed by NMR spectroscopy (S/ Appendix, 
Fig. $10). 


Crystallization. Crystallization conditions were screened by the sitting-drop 
vapor-diffusion method using a Gryphon (Art Robins). Drops were set up 
with 0.15 uL of $14 dissolved to 11 mg/mL in 10 mM BisTris-propane, 50 mM 
NaCl, pH 6.5, and 0.15-pL reservoir solution at room temperature. Diffracting 
crystals were obtained from the JCSG* screen [200 mM MgClz, 100 mM Bis-Tris 
pH 5.5, 25% (wt/vol) PEG3350] CSG Technologies) at 18 °C. Crystals were 
flash-frozen in liquid nitrogen using reservoir solution with 30% (wt/vol) 
glycerol as the cryoprotectant. Crystals were soaked with methyl-5,9-di-N- 
acetyl-a-neuraminoside and methyl-5-N-acetyl-4,9-di-O-acetyl-a-neuramino- 
side, as described, for the latter ligand, with batches previously used to solve 
HE holo-structures (18-20, 22). 


Data Collection and Structure Solution. Diffraction data of PHEV S1^ô crystals 
were collected at European Synchrotron Radiation Facility station ID30A-3. 
Diffraction data were processed using XDS (49) and scaled using Aimless 
from the CCP4 suite (50). Molecular replacement was performed using 
PHASER (51) with BCoV S14 residues 35-410 as template (PDB ID code 14H4). 
NCS-restrained structure refinement was performed in REFMAC (52), alter- 
nated with model building in Coot (53); water molecules were built in dif- 
ference density peaks of at least 5.0 o located at buried sites conserved and 
occupied by water in the 1.5-A resolution structure of BCoV $1*. Molecular 
graphics were generated with PYMOL (pymol.sourceforge.net). See Table 1 
for X-ray data statistics for PHEV $1”. 


Molecular Docking. Molecular docking of 9-O-Sia in the crystal structure of 
apo-BCoV S1* (PDB ID code 4H14) was performed with AutoDock4 (22) 


Table 1. X-ray data statistics PHEV S1° 


Data collection and refinement Statistic 


Data collection 


Wavelength, A 
Space group 
Cell dimensions 

a,b,c A 

wB y’ l 
Resolution range, A* 
No. unique reflections 
Redundancy 
Completeness, % 

Rmerge 

Vol 

CCi2 


Refinement 


Rwork/Rtree 

No. molecules in the 
asymmetric unit 
No. atoms 
Protein 
Carbohydrate 
Water 

Average B/Wilson B, Å? 

RMS deviations 
Bond lengths, Å 
Bond angles, ° 

NCS-restrained atoms 


Ramachandran plot; favored, 


allowed, outliers, % 


0.9677 
P3,21 


112.57, 112.57, 141.44 
90, 90, 120 
97.5-3.0 (3.07-2.97) 
20,445 (1,640) 

7.3 (4.1) 

93.5 (79.9) 
0.154 (1.56) 

14.0 (1.0) 

0.994 (0.462)* 


0.218/0.253 
2 


4,536 
193 
30 
98.1/94.5 


0.0078 
1.33 
0.054 
92.6, 6.9, 0.5 


*Numbers between brackets refer to the outer resolution shell. 
‘Diffraction is highly anisotropic: CCi2 in the outer resolution shell is 
0.845 for reflections within 20°from c*, whereas it is 0.00 within 20° from 
the a*b* plane. 
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similar to the procedure described by Bakkers et al. (22). The ligand mole- 
cule was extracted from BCoV HE (PDB ID code 3CL5). During docking, the 
protein was considered to be rigid. An inverted Gaussian function (50-A 
half-width; 15-kJ energy at infinity) was used to restrain the Sia-carboxylate 
near the position occupied by the sulfate ion in the BCoV $1° crystal struc- 
ture. The initial ligand conformation was randomly assigned, and 10 docking 
runs were performed. 


Preparation of BCoV S-Pseudotyped VSV Particles and Infection Experiments. 
Recombinant G protein-deficient VSV particles were pseudotyped as de- 
scribed previously (54), with the S protein of BCoV strain Mebus. To allow 
transport of S to the cell surface and its incorporation into VSV particles, a 
truncated version of S was used from which the C-terminal 17 residues, 
comprising an endoplasmic reticulum-retention signal, had been removed. 
To facilitate detection, the S protein was provided with a Flag-tag by 
cloning its gene in expression vector pCAGGS-Flag. HEK 293T cells at 70% 
confluency were transfected with PEl-complexed plasmid DNA, as de- 
scribed previously (18). For coexpression of BCoV S and HE-Fc, S expression 
vectors and pCD5-BCoV HE-Fc (18) were mixed at molar ratios of 8:1. At 
48 h after transfection, cells were transduced with VSV-G-pseudotyped 
VSVAG/Fluc (54) at a multiplicity of infection of 1. Cell-free supernatants 
were harvested at 24 h after transduction, filtered through 0.45-um 
membranes, and virus particles were purified and concentrated by sucrose 
cushion ultracentrifugation at approximately 100,000 x g for 3 h (10). 
Pelleted virions were resuspended in PBS and stored at —80 °C until further 
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use. Relative virion yields were determined on the basis of VSV-N content 
by Western blot analysis by using anti-VSV-N monoclonal antibody 10G4 
(Kerafast). Uptake of BCoV S into virus particles was detected by Western 
blot analysis with monoclonal antibody ANTI-FLAG M2 (Sigma). In- 
oculation of HRT18 monolayers in 96-well cluster format was performed 
with equal amounts of S-pseudotyped VSVs, as calculated from VSV-N 
content (roughly corresponding to the yield from 2 x 10° cells from each 
transfected and transduced culture), diluted in 10% FBS-supplemented 
DMEM. For virus infections with “exogenous” HE, 12.5 mU/mL of BCoV 
HE-Fc protein was added to the inoculum. At 18 h postinfection, cells were 
lysed using passive lysis buffer (Promega). Firefly luciferase expression was 
measured using a homemade firefly luciferase assay system, as described 
previously (55). Infection experiments were performed independently in 
triplicate, each time with three technical replicates. 
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